Online Learning with Gaussian Payoffs and Side Observations
نویسندگان
چکیده
We consider a sequential learning problem with Gaussian payoffs and side observations: after selecting an action i, the learner receives information about the payoff of every action j in the form of Gaussian observations whose mean is the same as the mean payoff, but the variance depends on the pair (i, j) (and may be infinite). The setup allows a more refined information transfer from one action to another than previous partial monitoring setups, including the recently introduced graph-structured feedback case. For the first time in the literature, we provide non-asymptotic problem-dependent lower bounds on the regret of any algorithm, which recover existing asymptotic problem-dependent lower bounds and finitetime minimax lower bounds available in the literature. We also provide algorithms that achieve the problem-dependent lower bound (up to some universal constant factor) or the minimax lower bounds (up to logarithmic factors).
منابع مشابه
Evaluation and Application of the Gaussian-Log Gaussian Spatial Model for Robust Bayesian Prediction of Tehran Air Pollution Data
Air pollution is one of the major problems of Tehran metropolis. Regarding the fact that Tehran is surrounded by Alborz Mountains from three sides, the pollution due to the cars traffic and other polluting means causes the pollutants to be trapped in the city and have no exit without appropriate wind guff. Carbon monoxide (CO) is one of the most important sources of pollution in Tehran air. The...
متن کاملReading English in the Computer Lab
The present study compares the performance of two TEFL reading classes: one taking place in a regular classroom and the other held in a computer lab, with the learners practicing reading online. The results of an independent samples t-test showed that the difference between the learners’ scores on their reading comprehension post-tests and pretests did not differ statistically significantly fro...
متن کاملOnline Learning of Wind-Field Models
We study online approximations to Gaussian process models for spatially distributed systems. We apply our method to the prediction of wind fields over the ocean surface from scatterometer data. Our approach combines a sequential update of a Gaussian approximation to the posterior with a sparse representation that allows to treat problems with a large number of observations.
متن کاملOnline learning of positive and negative prototypes with explanations based on kernel expansion
The issue of classification is still a topic of discussion in many current articles. Most of the models presented in the articles suffer from a lack of explanation for a reason comprehensible to humans. One way to create explainability is to separate the weights of the network into positive and negative parts based on the prototype. The positive part represents the weights of the correct class ...
متن کاملCorrelation between Online Learner Readiness with Psychological Distress related to e-Learning among Nursing and Midwifery Students during COVID-19 pandemic
Introduction: With the sudden shift of face-to-face education to e-learning during the COVID-19 pandemic, awareness of learnerschr('39') readiness for online learning and its impact on studentschr('39') psychological distress related to e-learning is important for teachers, counselors, and educational planners. Therefore, the present study was conducted to investigate the correlation between on...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015